Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update Retryable Exceptions #196

Merged
merged 2 commits into from
Oct 31, 2024
Merged

Update Retryable Exceptions #196

merged 2 commits into from
Oct 31, 2024

Conversation

movchan74
Copy link
Contributor

This pull request includes changes to improve the robustness and reliability of Aana SDK by updating retryable exceptions and preventing overflow in exponential backoff calculations.

Improvements to retryable exceptions:

  • aana/sdk.py: Updated the default list of retryable exceptions to include ActorDiedError and OutOfMemoryError in addition to InferenceException.

Enhancements to exponential backoff:

  • aana/utils/core.py: Modified the sleep_exponential_backoff function to cap the number of attempts at 32 to prevent overflow issues.

@movchan74 movchan74 added bug Something isn't working Reliability Aana is designed to be reliable and robust. It is built to be fault-tolerant and to handle failures labels Oct 31, 2024
@movchan74 movchan74 requested a review from HRashidi October 31, 2024 15:26
@movchan74 movchan74 self-assigned this Oct 31, 2024
@movchan74 movchan74 merged commit 9d4952e into main Oct 31, 2024
6 checks passed
@movchan74 movchan74 deleted the more_retryable_exceptions branch October 31, 2024 15:49
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working Reliability Aana is designed to be reliable and robust. It is built to be fault-tolerant and to handle failures
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants